First Iteration Policies for Admission Control in Multiaccess Networks
نویسندگان
چکیده
This work explores approximate methods to solve Markov decision processes for large systems through Policy iteration.Two methods, one using an embedded discrete time Markov chain and the other using time scale separation, are defined and compared with the solution obtained using traditional Policy iteration. First step solutions are found and compared for a radio resource management problem with two radio access technologies and two service types. The approaches proposed considerably reduce the computational cost while closely approximate the optimal solution. The solutions are extended by increasing the number of steps of policy iteration and results show that it is possible to reach the performance of the optimal policy when several steps are required reducing the computational cost.
منابع مشابه
Decentralized Dynamic Control of a Multiaccess Broadcast Channel
Absfrucf -Retransmission policies are presented for the decentralized control of a multiaccess packet-switched broadcast channel. The policies have a simple recurSiYe form yielding a Markov description of the system. Finite average delay is achieved for an infinite-population Poisson arrival model for any rate X < e ’ . It is proposed that the goal of retransmission policies should be to...
متن کاملEfficient Structured Policies for Admission Control in Heterogeneous Wireless Networks
In the near future, demand for Heterogeneous Wireless Networking (HWN) is expected to to increase. QoS provisioning in these networks is a challenging issue considering the diversity in wireless networking technologies and the existence of mobile users with different communication requirements. In HWNs with their increased complexity, “the curse of dimensionality” problem makes it impractical t...
متن کاملA Two-Threshold Guard Channel Scheme for Minimizing Blocking Probability in Communication Networks
In this paper, we consider the call admission problem in cellular network with two classes of voice users. In the first part of paper, we introduce a two-threshold guard channel policy and study its limiting behavior under the stationary traffic. Then we give an algorithm for finding the optimal number of guard channels. In the second part of this paper, we give an algorithm, which minimizes th...
متن کاملGuaranteed Quality-of-Service Wireless Medium Access by Packet-by-Packet Generalized Processor Shari - Communications, 1998. ICC 98. Conference Record.1998 IEEE International Conference on
To serve traffic with different characteristics and service requirements in multimedia wireless packet networks] we propose a multiaccess methodology PGPS/RAP with guaranteed quality-of-service (40s) to serve constant-bit-rate (CBR), variable-bit-rate (VBR), and available bit rate (ABR) traffic sources. This multiaccess methodology without any specific frame concept can guarantee worst-case del...
متن کاملEfficient exploration through active learning for value function approximation in reinforcement learning
Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares policy iteration (LSPI) framework allows us to employ statistical active learning methods for linear regression. Then we propose a design method of good sampling policies for efficient exploration, which is particularl...
متن کامل